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SUMMARY 


The  compound  Poisson  process  is  a  useful  model  for  describing  total  claim  costs  in  the 
insurance  industry.  In  this  investigation,  the  compound  Poisson  model  is  modified  to  allow 
dependence  among  the  compounding  variables  in  an  effort  to  more  accurately  model  situations  in 
which  successive  claim  awards  are  correlated  and/or  form  a  nonstationary  process.  Specifically, 
the  sequence  is  assumed  to  follow  an  ARIMA(p,d,q)  model.  This  type  of  model  is  illustrated 
with  the  use  of  actual  asbestosis  claim  cost  data  collected  from  Naval  shipyards. 
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POISSON  COMPOUNDING  OF  DEPENDENT  RANDOM  VARIABLES: 

A  STOCHASTIC  MODEL  FOR  TOTAL  CLAIM  COSTS 

JOHN  E.  ANGUS 

Department  of  Mathematics 
The  Claremont  Graduate  School 
143  E.  Tenth  Street 
Claremont,  CA  91711 

1.  Introduction  and  Preliminary  Development 
On  a  fixed  probability  space  (O,  3,  P),  let  {N^;  T  ^  0}  be  a  nonhomogeneous  Poisson 
process  with  mean  value  function  EN^  s  m(T),  T  >  0,  satisfying  m(0)  =  0,  m(»)  is 
nondecreasing,  m(T)  T  <»  as  T->«>o,  and  let  {X^,  t  ^  1}  be  a  sequence  of  random  variables, 
independent  of  the  Poisson  process.  When  the  XjS  are  independent  and  identically  distributed 
(iid),  the  process  {Cj,,  T  S  0}  defined  by 

J  (') 

(where  a  summation  with  upper  index  0  is  defined  to  be  0)  is  called  a  Compound  Poisson 
Process  (CPP).  The  CPP  is  used  extensively  in  modeling  the  total  claim  costs  in  insurance  risk 
analysis.  See  Prabhu  (1980),  for  example.  In  this  context,  claims  are  generated  according  to  a 
Poisson  process,  and  successive  claim  awards  are  represented  by  the  X^s.  When  applied  to 

specific  types  of  claims,  especially  those  involving  bodily  injury,  the  CPP  may  no  longer  be  an 
accurate  portrayal  of  the  process  because  the  sequence  of  X^s  may  be  autocoirelated  and  /  or 

nonstationary.  This  could  be  caused,  for  example,  by  the  effects  of  legal  precedent  established  in 
adjudicating  the  claims,  economic  trends,  and  so  on. 

To  account  for  possible  trends  and  autocorrelation  within  the  sequence  of  X-s,  the  CPP 
can  be  modified  by  assuming  that  {X^,  t  ^  1-d}  is  an  ARIMA(p,  d,  q)  process.  That  is, 
introducing  the  back-shift  operator  B  (B^Xj=Xj_j)  and  difference  operator  V  (V®Xj=Xj,  VX^  = 
(l-B)Xj  =  Xj-Xj_  j,  V-jx  =V(vj‘^Xj),  j  ^  1 ),  it  is  assumed  that  for  t  ^  1, 

V^Xj  =  ii  +  Yj  (2) 
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where  {Y^}  s  {Y^,  t  =  0,  ±1, ...  )  is  a  zero-mean  causal  ARMA(p,  q),  and  is  a  constant.  This 
in  turn  means  that  {Y^j  is  a  zero-mean  second  order  stationary  sequence  that  satisfies  a  set  of 
difference  equations  of  the  form 

(p(B)Yj  =  e(B)Z^  (3) 

where  (p(B)  =  1  -  (pjB  -  -  ...  -  <p  fiP  e(B)  =  1  +  BjB  -t-  G^B^  -t-  ...  -t-  0  B^^,  and  the 

polynomial  (p(«),  when  treated  as  a  function  of  a  complex  variable  z,  has  no  roots  inside  or  on  the 
unit  circle  in  the  complex  plane.  The  sequence  {Z^}  is  assumed  to  be  a  white  noise  sequence, 

that  is,  EZ=0,  and  EZ^Z^  =  a^5(t  -  s)  for  all  integers  s  and  t,  where  5(k)  =  0  if  k  0,  and  5(0)  = 

1.  This  is  usually  denoted  by  {Y^}-ARMA(p,  q),  and  {Z^}-WN(0,  a^). 

There  are,  of  course,  infinite  choices  for  the  process  {X^}  that  would  account  for 

autocorrelation  and  /  or  nonstationarity.  The  ARIMA  choice  is  particularly  attractive  for  many 
reasons.  For  example,  it  is  known  that  ARMA  (p,  q)  process,  the  d  =  0  case,  can  approximate  a 
broad  class  of  stationary  processes  whose  autocorrelation  functions  approach  0  as  the  lag 
approaches  oo.  In  addition,  the  ARIMA  process  describes  processes  that  possess  a  deterministic 
or  stochastic  polynomial  trend,  as  well  as  explosive  -  type  nonstationarity  (as  exemplified  by  the 
simple  random  walk  model).  Finally,  statistical  techniques  for  identifying,  estimating, 
diagnosing,  and  forecasting  ARIMA  models  are  well-developed,  mature  (see  Box  and  Jenkins, 
(1970),  Priestley  (1981),  or  Brockwell  and  Davis  (1992),  for  example),  and  highly  automated  in 
popular  statistical  computing  packages,  making  them  imminently  practical  for  this  application 
and  others  in  which  dependent  sequences  must  be  modeled. 

To  demonstrate  the  plausibility  of  this  approach  to  modeling  actual  claim  costs,  221 
successive  asbestosis  claims  collected  from  US  Naval  shipyards  were  compiled.  These  are  listed 
in  Table  2  of  the  Appendix.  These  are  plotted  in  Figure  2.  Figures  3  and  4  show,  respectively, 
the  estimated  autocorrelation  and  partial  autocorrelation  fiinctions  of  the  series  after  differencing 
once  at  lag  1.  These  suggest  that  perhaps  an  MA(1),  AR(5)  or  mixed  model  would  be 
appropriate  to  describe  the  differenced  series.  Figure  5  shows  the  estimated  autocorrelation 
function  of  the  estimated  residuals  from  a  fitted  ARIMA(0,  1,  1)  model  for  the  original  series. 
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which  yield  a  Portmanteau  test  on  the  first  20  values  of  15.01  on  19  degrees  of  freedom,  which 
appears  consistent  with  whiteness  of  the  residuals  and  adequacy  of  the  fit.  The  fitted  model  is 
Xj  =  +  244.13  +  Zj  -  .93  {Z^j-WNfO,  6.04  x  10^). 

Figure  1  is  a  plot  of  cumulative  number  of  asbestosis  claims  versus  time  reported  from  the  same 
Naval  Shipyards.  These  are  not  in  one-to-one  correspondence  with  the  claim  amounts  in  Table  2 
because  not  all  claims  had  been  adjudicated.  Figure  1  also  contains  a  fitted  5th  degree 
polynomial  approximating  the  mean  value  function  of  a  postulated  underlying  nonhomogeneous 
Poisson  process.  Finally,  Table  1  gives  observed  counts  versus  expected,  computed  from  the 
postulated  nonhomogeneous  Poisson  model,  for  randomly  selected  time  intervals.  Again,  the 
data  appear  consistent  with  a  nonhomogeneous  Poisson  process. 

Without  further  assumptions  on  the  sequence  {Z^),  distribution  theory  for  {Cj.}  cannot 

be  addressed.  However,  assuming  only  that  {Z^j-IIDfO,  cP"),  asymptotic  distributions  for  Cp 

for  large  T,  can  be  developed.  This  is  the  subject  of  the  next  section.  The  final  section  discusses 
how  the  asymptotic  results  may  be  used  in  practice. 


2.  Asymptotic  Distribution  theory  for  C  j 


By  the  assumptions  on  m(T)  =  ENp  asymptotic  distribution  theory  for  Cj.  depends  on 
the  following  results  from  Angus  (1992).  Let  {X^,  t=l-d,  2-d, ...  0, 1, 2, ...}  be  an  ARIMA(p,d,q) 
process,  satisfying  V^Xj=Yj+n  where  (Y^)  is  a  causal  ARMA(p,q)  process  with 
{Z^}-IID(0,a^).  Define  the  usual  operation  of  generating  factorial  polynomials  by 

(k-i+1)  =  k(k-l)...(k-j+l)  for  integers  j  and  k  with  j>0,  and  k^^^sl.  Thus,  with  k  treated  as  a 


variable,  it  follows  that  for  j^, 

V  (k+l)^'^^V[(j+l)!]  =  k^Vj!.  (4) 

and 


The  main  results  from  Angus  (1992)  are  the  following  theorem  and  corollary,  which  hold  even 
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for  the  case  d=0  with  the  convention  that  summations  with  upper  limit  0  are  taken  to  be  0.  The 
symbol  denotes  convergence  in  distribution. 

Theorem  1.  Let  YyC*)  ^  autocorrelation  function  of  and  suppose  that 

Then,  as  n-^oo,  the  distribution  of 

[yy(0)  lS=i[(v-Kl-l/'^^]^  +  2 
converges  to  that  of  N(0,1). 

Corollary  1.  Under  the  conditions  of  Theorem  1,  n’^^’^£j|_j(x^-4(t+d-l)^‘^Vd!)  — > 
2 

N(0, 0)  )  as  n-^oo,  where 

co^=(2d+l)-'d!-^YY(0)+2I^^,YY(h)].  (7) 

The  proofs  of  these  are  given  in  Angus  (1992),  and  make  use  of  the  following  Lemma,  also 
proved  in  Angus  (1992),  that  will  be  of  further  use  in  the  present  discussion. 

Lemma  1.  Suppose  that  V^Xj=Yj+^i,  ^1.  Then  for  t^l,  X^  can  be  expressed  as 

^  (d-D!  ® 


and  for  n>l 


,  n(n+d)W*’> 


^t=l  t  ^i=0  0  (i+1)! 


(d+1)! 


(v+d-1)^ 
v+1  ^ 


The  next  theorem  gives  the  asymptotic  distribution  of  Cj,  as  T-»«».  Here,  the  following 

notation  will  be  used.  For  sequences  a„  and  b  >0,  T„  is  AN(a  ,  b  )  means  that 

^  n  n  n  n  n 


^  “  (2Tt)'^'^^J*^exp(-x^/2)dx 


for  all  real  t,  where  n  may  be  a  discrete  or  continuous  variable,  and  the  limit  is  taken  as  n-^  c 
where  c  is  real  or  ±oo. 

Theorem  2.  Let  (X^)  in  (1)  be  an  ARIMA(p,  d,  q)  as  in  Theorem  1,  and  assume  the 
conditions  and  notation  given  there.  Then,  as  T->«»,  Cp  is  AN(Vp  t^)  where 
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(10) 


(d+1)! 


^ = 53?55;RV0) + 


From  Lemma  1,  by  conditioning  on  {Xj_j,  ^2-dJ  ""  ^0^’  following  corollary  is 


immediate. 

Corollary  2.  Under  the  conditions  of  Theorem  1,  if  d  ^  1,  then  conditioned  on  X, 

2 

....  Xq},  Cj.  is  AN(Vp  t^)  where 

T  A=0  0^  (i+1)!  (d+1)! 

2 

and  Tj  IS  given  m  (1 1). 


(d+1)! 


Another  useful  corollary  is  the  special  case  d=0,  which  indicates  that  the  process  (X^)  is 
a  causal  ARMA(p,  q)  process  with  mean  |i,  having  autocovariance  function  Yy(h)  =  cov(X^, 


Corollary  3.  Under  the  conditions  of  Theorem  2  with  d=0,  as  T-^o®,  Cj.  is  AN(VpXlp) 


where 


Vy  =  p  m(T) 


4  =  m(T)(yY(0)+2i:~  jYyCh)  +\i^) 


Proof  of  Theorem  2.  From  (9),  define  the  polynomial  p(n)  with  random  coefficients  by 

p(n)  s  ,  Then  with  v™  and  as  in  (10)  and  (1 1)  it 

^i=0^  0'  (i+i)!  (d+1)!  T  T 


follows  that 


<V'’t  P(Nt)-Vi 
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By  independence  of  the  processes  {X^}  and  (N^),  the  two  random  variables  on  the  right  side  of 

(15)  are  uncorrelated.  By  Theorem  1  and  Corollary  1,  the  first  term  on  the  right  side  of  (15) 
converges  in  distribution  to 


r  ^ 

N,(0,1)  - 

^  Ly^(O)  + 


YY(0)+2lh=i'yY(h)  '|l/2 


Yy(0)  +  2  Ih=iYY(h)  +  \i\2d+n 
and  by  a  first  order  Taylor  expansion  of  the  function  f(x)  =  (x+d)^^^^\  the  second  term  on  the 


11/2 


right  side  of  (15)  converges  in  distribution  to 

_ r  nWi) 

N^(0,1)  A  I  , 

^  Lyy(O)  +  2  Ih=l'YY^>  + (2d+l)J 

where  Nj(0,l)  and  N2(0,l)  denote  independent  standard  normal  random  variables.  The  result 
now  follows  immediately. 


3.  Application 

In  practice,  when  T  is  large,  Theorem  2  can  be  used  to  approximate  the  distribution  of 
Cj,.  If  all  parameters  are  known,  then  the  Theorem  can  be  applied  directly.  Otherwise,  the 

parameters  must  be  estimated  from  data.  The  parameters  that  must  be  estimated  include  the 
mean  value  function  m(T)  for  the  Poisson  process,  and  the  parameters  in  the  ARIMA(p,  d,  q) 
process,  namely,  p,  d,  q,  <Pp  tp^,  0j, ...,  0^,  and  o^.  Efficient  estimation  of  these  inn  turn 

leads  to  efficient  estimation  of  the  autocovaiiance  function  Yy^*)-  Appropriate  data  for 

accomplishing  this  are  those  listed  in  the  Appendix  for  the  asbestosis  claims.  For  illustrative 
purposes,  assume  that  the  claims  are  generated  according  to  an  homogeneous  Poisson  process. 
Then  for  this  dataset,  m(«)  would  be  estimated  by 

m(T)  =  1.63  T.  (16) 

As  previously  discussed,  the  series  of  claim  amounts  fits  an  ARIMA(0,1,1)  model 

Xt  =  \l+^^'"^t'^®l^t-l’ 

with  )i=244.13,  0j=  -.93,  and  o^=6.()4  x  10^.  The  autocovaiiance  function  of  Yj=VX^  in  this 
case  is 
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(17) 


Yy(0)  =  0^(1  +  0j),  =  0^®!’  VyCh)  =  0*  for  lhl>l. 

Thus,  (10)  and  (11)  become 

^ 244,13(1.637. 1)(L6_3T) ^ ^ ^  ^ 


2  (1.63T)- 

"t=  — t- 


=  (4.5x10") 


{(6.04  X  10*)(l  +  (-.93)^  +(2)(-.93))  +  (244.13)^(3))  (18) 

,6)t3 


so  that  in  this  example,  Cj,  is  AN(Vp  x^)  with  Vq.  and  given  as  in  (17)  and  (18).  This  can  be 

compared  to  the  approximation  that  would  be  obtained  if  the  claim  data  were  treated  naively  as 
IID.  If  this  were  done,  the  sample  mean  of  the  Xj  would  be  15098.16,  the  sample  variance 

o  2 

would  be  5.62  x  10  ,  and  the  approximation  would  be  CIj  is  AN(\|/p  ^,j,)  where 

V,j,=  15098.16(1.637)  =  24610.00  7  (19) 

=  (1.63T)(5.62  X  10*  +  (15098.16)^)  =  (1.3  x  10^)  7.  (20) 

The  discrepancy  between  (17)  and  (19),  and  between  (18)  and  (20)  is  large,  with  (19)  and  (20) 
being  of  the  wrong  order  of  magnitude  in  7. 


4.  Conclusions 

A  generalization  of  the  CPP  model  for  insurance  claim  costs  has  been  presented  and  its 
plausibility  demonstrated  using  asbestosis  claim  data  collected  from  Naval  shipyards. 
Asymptotic  distribution  theory  has  been  developed  to  allow  approximation  of  the  distribution  of 
total  claim  costs  during  [0,  7]  for  large  7.  These  approximations  and  a  numerical  example 
demonstrate  that  large  discrepancies  can  be  experienced  when  applying  the  naive  CPP  when  the 
more  general  model  using  an  ARIMA(p,d,q)  process  is  appropriate. 
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Appendix:  Tables  and  Figures 


Table  1: . Observed  versus  expected  number  of  claims 

Table  2: .  Asbestosis-related  claim  costs,  1/90  -  6/91 

Figure  1 . Cumulative  number  of  asbestosis-related  claims  versus  time, 

along  with  fitted  Sth-degree  polynomial 

Figitfe  2: . Asbestosis-related  claim  costs,  1/90  -  6/91,  time  plot 

Figure  3: . Estimated  autocorrelation  function  of  asbestosis  claim  cost 

data  (asbestosis  cost  series  differenced  at  lag  1) 

Figure  4: . Estimated  partial  autocorrelation  function  of  asbestosis  claim 

cost  data  (asbestosis  cost  series  differenced  at  lag  1) 

Figure  5: . Estimated  autocorrelation  function  of  estimated  residuals  from 

asbestosis  cost  data  fitted  to  an  ARIMA(0,1,1)  model 
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Table  1:  Observed  aaims  Versus  Expected 
Time  Interval  Observed  (O)  Expected  (E)  (0-E)^/E 


2 

62 

117 

120.814 

0.12041 

62 

71 

16 

19.953 

0.78331 

71 

132 

136 

135.116 

0.00579 

132 

145 

27 

27.534 

0.01037 

145 

195 

105 

98.055 

0.49184 

195 

222 

45 

47.070 

0.09104 

222 

244 

28 

35.278 

1.50164 

244 

298 

77 

76.380 

0.00504 

298 

351 

79 

65.320 

2.86491 

351 

450 

107 

119.842 

1.37617 

450 

490 

54 

54.553 

0.00560 

490 

520 

48 

44.331 

0.30363 

520 

544 

35 

37.362 

0.14936 

Chi-square  =  7,71  on  6  degrees  of  freedom  (6  parameters  were  estimated). 
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Table  2 

Asbestosis  Claim  Costs;  Naval  Shipyards,  January  1990  through  Jxjne  1991 


(Read  Data 

Across) 

853 

1082 

1736 

12035 

29201 

27454 

29933 

15832 

29710 

17809 

39355 

23214 

609 

10835 

44982 

2192 

18571 

93514 

24097 

561 

23975 

17117 

8782 

6365 

58995 

656 

21587 

49213 

6369 

1587 

335 

14456 

124 

1445 

137 

1014 

9848 

660 

7740 

30045 

542 

21303 

18366 

11408 

26127 

17096 

165 

455 

34596 

672 

40184 

3067 

44565 

74062 

1216 

51622 

673 

30781 

1325 

22 

828 

19393 

24764 

1004 

30319 

555 

593 

33591 

8659 

1075 

25730 

2426 

15152 

505 

375 

1768 

23327 

65 

25336 

557 

8350 

1932 

191 

2003 

237 

265 

45838 

3275 

638 

2461 

15718 

1045 

15925 

60 

250719 

432 

448 

26962 

53647 

19201 

1858 

27388 

1331 

528 

356 

24614 

30735 

457 

720 

265 

11557 

4388 

566 

44606 

26693 

266 

21380 

24891 

27628 

22 

37148 

26608 

15843 

16177 

49743 

1381 

1052 

14444 

903 

753 

5227 

695 

44009 

9353 

420 

108199 

248 

1436 

1894 

991 

5377 

29479 

60 

32700 

425 

519 

20725 

16417 

969 

22484 

245 

1257 

634 

2016 

561 

950 

385 

255 

54561 

35473 

42308 

2186 

3803 

16445 

1946 

24786 

679 

948 

17734 

68921 

24389 

578 

30277 

2563 

9511 

25944 

13553 

389 

7930 

368 

26727 

10737 

21835 

1111 
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